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Abstract. 

In this note we revisit E. Cartan's address at the 1928 International Congress of 
Mathematicians at Bologna, Italy. The distributions considered here will be of the same 
class as those considered by Cartan, a special type which we call strongly non-holonomic. 
We set up the groundwork for using Cartan's method of equivalence (a powerful tool for 
obtaining invariants associated to geometrical objects), to more general non-holonomic 
distributions. 
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Le vrai probleme de la representation geometrique d'un systeme 
materiel non holonome consiste [••■] dans la recherche d'un 
schema geometrique lie d'une maniere invariante aux proprietes 
mecaniques du systeme. Elie Cartan 

1. Introduction 

In this note we revisit E. Cartan's address [I] at the 1928 International Congress 
of Mathematicians at Bologna, Italy. The distributions considered by Cartan were of a 
special type which we call strongly non-holonomic. Our aim is to set up the groundwork for 
using Cartan's method of equivalence (a powerful tool for obtaining invariants associated 
to geometrical objects, [2]), to more general non-holonomic distributions. 

This is a local study, but we outline some global aspects. If the configuration space Q 
is a manifold of dimension n, its tangent bundle TQ should admit a smooth subbundle 
E of dimension m, m < n. As it is well known, this imposes topological constraints on 
Q, see [3J. Although we will be discussing only local invariants, hopefully these will help 
constructing global ones, such as special representations for the characteristic classes [I], 
0- 

Notation. Throughout this paper we follow consistently the following convention: cap- 
ital roman letters /, J, K, etc. run from 1 to n. Lower case roman characters i, j, k run 
from 1 to m (representing the constraint distribution). Greek characters a, j3, 7, etc., run 
from m + 1 to n. Summation over repeated indices is assumed unless otherwise stated. 

2. Non-holonomic connections 

We fix a Riemannian metric g on Q and let V the associated Riemannian connection 
(torsion free and metric preserving: 

(1) VxY - V Y X = [X, Y] , X(Y, Z) = (V X Y, Z) + (Y, V X Z) ). 

In section [S] we consider an arbitrary affine connection (see [9]) on Q. Recall that given 
a local frame ei on V C Q and its dual coframe uj, a connection V is described by local 
1-forms Uu = —ujji such that 

(2) V x ej = u LJ {X) ej . 

The torsion tensor is T(X, Y) = V X Y - V y X - [X, Y\ = t^X, Y) e 7 and expanding the 
left hand side we get the structure equations 

(3) duj + ujj A Uj — 1 1 . 
As the Riemannian connection V is torsion free: tj = 0. 

We assume heretofore that the frame is adapted to the distribution E. This means 
{ei(q)} span the subspace E q , q e Q, and the remaining {e a } span the ^-orthogonal space 
F = E ± 

Definition 2.1. The (Levi-Civita) non-holonomic connection on E is defined by the rule 

(4) D x Cj = u)ij{X)e h = 1, ...,m) . 
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Here we allow X to be any vectorfield on Q, not necessarily tangent to E. Notice that 
for vectorfields Y, Z tangent to E, the metric-compatibility 

(5) X(Y, Z) = (D X Y, Z) + (Y, D X Z) 
still holds. 

The motivation for this definition is D'Alembert's principle: Consider a mechanical 
system with kinetic energy hg(c, c) and applied forces F, subject to constraints such that 
c G E c (ty The constraining force VcC — F is g -perpendicular to the constraint subspace 
E c (t), since it does not produce work. 

Unless otherwise mentioned, we assume there are no applied forces. The geodesic 
equations are given, in Cartan's approach, by 

(6) T = , D T T =[J£. + pjUijiT) ] e* = . 

One may wish to see the equations explicitly. Choose a coordinate system x on V C Q. 
Define m 3 functions T l - k {x) on V by 

(7) D ekej = r jkei . 

Write 

(8) c(t) = p k (t)e k 

(some authors call the p k = u k (c(t)) quasivelocities) . The equation D t c = gives a 
nonlinear system in n + m dimensions for x and p given by 

-jj- = Pk e R {x), — = -p iPk T J ik {x) . 

Here e k R is the i?-th component (1 < R < n) of the k-th S-basis vector (1 < k < m) in 
terms of the chosen trivialization of TQ. 

3. Geometric interpretation. 

Direct and inverse "development" of frames and curves were so obvious to Cartan 
that he did not bother to give details. We elaborate these concepts, exhibiting explicitly 
(Theorem 13.41 below) a system of ODEs producing at the same time, the solution of 
the non-holonomic system, a parallel frame along it, and a representation of the non- 
holonomic systems on the euclidian space R m , the hodograph of frames to R m . 

3.1. Direct parallel transport of a .E^-frame. A frame for E qo C T qo Q can be 
transported along a curve c(t) in Q. The "novelty" here (as stressed by Cartan): c(t) is 
an arbitrary curve in Q, that is c(t) does not need to be tangent to E. 

Recall that given a tangent vector V° = v° e-j G E qo and a curve c{t) G Q,c(0) = q , 
there is a unique vectorfield V(t) G E c m , V(0) = V° such that D d V(t) = . In fact, we 
are led to the linear time-dependent system of ODEs 

(9) 

( using D 6 ej = -u jk (c)e k and © )• 

In particular, an orthonormal frame at E Qo is transported to E c m and remains orthonor- 
mal. 



V1 > 
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3.2. Hodograph of a E^-frame to R m . Given a curve of frames for TQ along c(t), 
{ej(t)}, consider the frame for E c ^ formed by the first m vectors e^. We have 

V £ ei = -uJij(c) ej - u ia (c) e a 

We develop a "mirror" or hodograph frame {U(t) : Ui(t), • • • , u m (t)} confined to R m = 
E qo) q Q = c(0), solving the system for U(t) E 0(m) given by 

(10) Ui = -u ik (c(t))uk , Ui(0) = ei(q ) . 
Equivalently (by elementary matrix algebra) 

(11) U- 1 U = (cjij c(t)) , £7(0) = Id 
where Ui are the columns of £7. 

Lemma 3.1. Let {£7(0) : itj(0) = e,(0)} a frame for E Qo . The hodograph of its direct 
paralel transport {ei(t), ■ • • , e m (t)} along c(t) is the constant frame U(t) = £7(0) = Id . 

Proof. This is because D^e^t) = iff Uij(c(t)) = 0. 

Proposition 3.2. Let U{t) a curve of frames in M m . Define a frame {ii(t), ■ ■ ■ , e m (t)} for E c ^ 
by 

(12) (ei, • • • ,e m ) = (ei, • • • , e m ) U 
where {ei, • • • , e m } is parallel along c(t). Then 

i) ei, • • • ,e m satisfies D^j = —ujjiti with (wjj) = £7 _1 £7. 

ii) The hodograph of e±, - • • ,e m to R m is U (t) . 

Proof. It sufficies to prove i) and it is simple: 

D & (e u ■ ■ ■ ,e m ) = (ei, • • • ,e m ) £7 + (D 6 ei, ■ ■ ■ ,D 6 e m ) U 

= (ei, • • • , e m ) U = (ei, • • • , e m ) U U^ 1 U = (e u • • • , e m )(uij) 

3.3. Hodograph to R m of a curve c(t) in Q. Consider c(t), a curve in Q, c(0) = q . 
As before, it is not assumed that c(t) is tangent to E c m. Let {e/ : ei(t),--- ,e n (t)} a 
local orthonormal frame for T q Q along c(t) with {ei,-- - ,e m } tangent to E q . Denote 
uj,I = 1, • • • ,n the dual basis. Construct first the hodograph {ui(t),--- ,u m (t)} of 
{ei, • • • , e m } to M m , and then define 

(13) 7 (t) = je = [ujxifyu^t) + ■■■ + u m {t)u m {t)] dt . 

where (9 is an 1-form with values in R m given by 

(14) 9 = 

and for short we wrote Ui(t) = u>i(c(t)). 

The curve -y(t) in M. m = E qo , is called the hodograph of c(t) to R m . If e^t), ■ ■ ■ ,e m (t) 
are parallel along c, then 7(f) is given by 

7(t) = ( / uidt,-- - , u m dt) 
Jo Jo 

taking the coordinate axis of R m along -ui(O) = • • • , u m (0) = u m (t). 
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3.4. Development on Q of a curve ^(t) G R m = E Qo . On the other direction, 

Proposition 3.3. Given a curve 7(t) in KL m = E qo , we can construct a unique curve c(t) 
in Q tangent to E whose hodograph is j{t). The curve c(t) is called the development of 
lit)- 

Proof. First, extend an ^-adapted basis for T q Q in a neighborhood q G V C Q, with 
corresponding forms oj/j, I, J — 1, ■ ■ ■ ,n. Then consider the vectorfield in V x 0(m) 
given by 

771 

(15) X = {U- 1 j(t)). e, , U = U(u tJ (X)) , c(0) = q, U(0) = id . 

i=i 

Integrating this vectorfield we obtain a curve (c(t), U(t)). 

We claim that the hodograph of c(t) is j(t) = 7(t) (the vectorfield was constructed 
precisely for that purpose). Indeed, by the previous item, 

(16) 7 (i) = / [tZ-^Wli «!+••• + [U'^{t)} m u m (t)} alt . 

Jo 

so that 

^=[U-^(t)] iUl 

which is equal to 7 by elementary linear algebra: if v is any vector and U any invert- 
ible matrix, v = (U~ 1 v)iUi, where Ui are the columns of U (U is the matrix changing 
coordinates from the basis to the canonical basis). 

What if we had used a different frame ej on Ul We would get a system of ODEs 
in 

(17) Y = {T- 1 Ht)), Ik , f = T(p t3 (Y)) , c(0) = q, T(0) = id 

i=l 

and we claim that Y = X so the curve c(t), is unique. 

To prove this fact it equivalent to show that T = UP where P changes basis from 
ei,i = 1, ■ ■ ■ , m to ej, i — 1, • • • , m , that is 

(18) (ei, • • ■ ,e m ) = (ei, • • ■ , e m ) P . 
We compute 

t- x t = (UPy 1 {UP + UP) 

= p~ x P + p- x {\j- x ij)p = p~ l P + p- l {uij)p 

which is indeed the gauge-theoretical rule giving the forms (pij) of the basis defining 
T from the forms (wy) of the basis e,. 

We can upgrade this construction to provide a parallel frame along c(t), by declaring 
= 0. This gives 

(19) P+(u ij (X))P = . 



which could be added to system (U51) . Actually, we can take the equation for U out of 
that system, observing that U = P _1 . 

(Proof: If/" 1 = -U^UU- 1 = -(cu^lf- 1 ). 
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Theorem 3.4. Given a curve j(t) G M m , consider the nonautonomous system ODEs in 
the manifold Fr(E) given by 

m 

(20) X = J2 (PiWi ei,P= {-^{X))P , c(0) = q, P(0) = id . 

i=i 

It gives the developed curve c(t) on Q and an attached parallel frame 

(21) (ei, ■ • • ,e m ) = (ex, ■ ■ ■ ,e m ) P . 

For a line j(t) = tv passing through the origin in M. m we obtain the non-holonomic 
geodesic starting at q with velocity c(0) = v. 

3.5. Hodograph of the D'Alembert-Lagrange equation. We elaborate on the 
comments of §7 in [10 Consider a mechanical system with kinetic energy T and external 
forces F (in contravariant form), subject to constraints defined by the distribution E. 
The non-holonomic dynamics is given by 

(22) D t c = F 11 

where the right hand side is the ortogonal projection of F over E. 

Let 7 be the hodograph of c. Fix a constant frame /]_,•■• , f m on K m and write 

7(*) = X>WA ■ 

i 

Let ej be the parallel frame along c(t) obtained in Theorem 13.41 Decompose 

(23) F" = /ie< . 
Corollary 3.5. (Cartan, [I], §7). Equation ^EE) is equivalent to 

(24) %(t) = mt)) . 

Equations (2$ should be solved simultaneously with j[W\) . 

This approach can be helpful for setting up numerical methods, and in some cases 
reducing the non-holonomic system to a second order equation on R m . We also observe 
that F can represent non-holonomic control forces actuating over the system, as those 
studied in [S]. 

4. Equivalent connections. 

In this section and the next we discuss the question of whether two non-holonomic 
connections D and D on E have the same geodesies. 

Given A 6 GL(n — m),C G 0(m), B G M(m, n — to), we take 

(25) ZJ a = A a \U\ , Ui = CikWk + Bi\U\ 

This is the most general change of coframes preserving the sub-Riemannian metric 

(26) g sub — \-uJ 2 rn 

supported on E. The corresponding dual frame e/ satisfies 

(27) 6^ CjCjj , 6q, e%B{ a -)- e\A\ a 



la La trajectoire du systeme materiel, suppose soumis a des forces donnees de travail elementaire 
^2 PjpJi, se d'eveloppe suivant la trajectoire d'un point materiel de masse 1 place dans l'espace euclidien 
a m dimensions et soumis a la force de composantes Pi." 
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(here, for ease of notation we place scalars after vectors). 
In matrix form, we have 

(28) 





( c 


B x 








57 = 1 


v o 




) w . 















(29) (ei, • • • ,e n ) = (ei, ■ ■■ ,e n ) 



C B 
A 



Using matrix notation is not only convenient for the calculations, but also to set up the 
equivalence problem ([2]). Consider the linear group G of matrices of the form 

(30) 9=(q a) ' ° E °( m )' AG GL(n-m),B e M(m,n-m). 

The equivalence problem for sub-Riemannian geometry can be described as follows: 
Given coframes fiy = (f2y, ■ • • , f2y)* and uu = (w^, • • • , oJjjf on open sets U and V, 
find invariants characterizing the existence of a diffeomorphism F : hi — > V satisfying 
F*Oy = 9 ■ wu- F° r this study, see Montgomery [TO] . 

In non-holonomic geometry we are lead to a more difficult equivalence problem (see 
section below). In [TJ, Cartan characterized the non-holonomic connections only for 
a certain type of distributions, which we will call strongly non-holonomic. Interestingly, 
Cartan did not work out the associated invariants, even in this case. He focused in finding 
a special representative in the equivalence class of connections with the same geodesies. 

Consider the modified metric on Q 

(31) g = ulf + ■■■ + uJ^ 2 

and the associated Levi-Civita connection D. The geodesic equation is 

— ds- 

(32) T = Sj e 3 , D T T =[~^ + s k u jk {T) ] e d = . 

To compare (152"]) and flBJ), there is no loss in generality by taking C = id. By inspection 
one gets: 

Proposition 4.1. (Cartan, [TJ, §5.) Fix C = id. The geodesies of D and D are the same 
iff 

(33) u ij {T) = uj ij {T) 
for all T tangent to E. 

5. Pfaffian systems and Lie algebras of vectorfields. 

5.1. Equivalent 1-forms. In view of ( 133]) it seems useful to introduce the following 

Definition 5.1. Two 1-forms oj\ and U2 are i?-equivalent if u± — 002 anihilates E. We 
write uii ~eU2 or simply lo\ ~ uj 2 . 

In the C°° (<5)-ring of differential forms A*(Q), consider the ideal X generated by the 
1-forms u a , a — m + 1, • • ■ ,n. We can write 

(34) E = l ± 

Clearly ui ~£ u 2 is equivalent to u\ — u 2 = Yl fa u a G X. More generally, two k- forms 
uj\ and 0J2 are said to be E'-equivalent if their difference vanishes when one of the slots 
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(vi, • • • ,Vk) is taken on E. Again, this means that u>\ — o> 2 G X. (In fact, given a Pfaffian 
system of 1-forms on Q 

6\ = 0, • • • , 9 r = 0, 

one can form the ideal X on A*(Q) generated by these forms. Every form that is annulled 
by the solutions of the system belongs to X, see [6] p. 232). 

If ui ~£ U2 it does not necessarily follow that du\ ~£ dw<i. For the later to happen, 
the former must be equivalent over a larger subspace, (X 1 )- 1 D E which we now describe. 

5.2. Filtrations in TQ and in T*Q. Let X a Pfaffian system. 
Definition 5.2. The derived system D{X) is 

(35) =D{i) = \e = Y J a l e i 

I i=l 

One constructs (see [7J) the decreasing filtration 

■ ■ • c x< 2 ) c x« c x(°) 

defined inductively by X( fc+1 ) = (X^) 1 . 

Here X is thought as a submodule over C°°(Q) consisting of all 1-forms generated by 
the 8{. We assume all have constant rank. The filtration eventually stabilizes after a finite 
number inclusions, and we denote this space X final . By Frobenius theorem, the Pfaffian 
system X final is integrable. Fix a leaf S and consider the pull back of the filtration. That 
is, we pull back all forms by the inclusion j : S — >■ Q. The filtration associated to j*X 
stabilizes at zero. 

There is a dual viewpoint, more commonly used in non-holonomic control theory ([12]): 
given a distribution E in TQ one considers an increasing filtration 

E = E C E 1 C E 2 C • • • 

Two (different) options are used: 

1) Ei = E^i + Ei-i] . 

2) Ei = Ei-i + [E Q , Ei-i] = Ei-i + J2j+k=i-i[Eji ■£*] • 
Lemma 5.3. C (so E 1 C (XW) X ). 

Proof. Let X, Y, Z G We want to show that for any 9 G X\ 

9(X+[Y,Z]) = . 
Well, 9(X) = by default and (the correct signs do not matter) 

9[Y, Z] = ±d9(Y, Z) ± Z9(Y) ± Y9(Z) = 
as 9{Y) = 9{Z) = because 5 G X and ^(F, Z) = because 9 G X x . 

6. Main result. 

6.1. Strongly non-holonomic distributions. The main question to be addressed 
in the local theory is the following. Assume that the geodesies of two Levi-Civita non- 
holonomic connections D and D are the same. Proposition 14.11 says that a necessary and 
sufficient condition for this to happen is uJjj ~£ wy. 



d6 el 



X 
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What are the implications of this condition in terms of the original coframes u = 
(u>i, ■ ■ ■ , uj n ) and u = (aJi, • • ■ , aJ n )? The answer is that it depends on the type of distri- 
bution E. 

One extreme: suppose E is integrable, that is X^> = X. There is a foliation of Q by 
m-dimensional manifolds whose tangent spaces are the subspaces E q . Then it is clear 
that there are no further conditions. We can change the complement F = E 1 - without 
any restriction, and the metric there. In fact, we can fix a leaf S and the Levi-Civita 
connection on S will coincide with the projected connection, no matter what g is outside 
E. 

The other extreme is the case studied in pQ): 

Definition 6.1. We say that the distribution E is of the strongly non-holonomic type if 
the derived Pfaflian system associated to E is zero. 

We now prove 

Theorem 6.2. (Cartan, [TJ, §5.) In the strongly nonholomic case, the metrics g and g 
must have the same complementary subspaces. In other words: B = 0. Thus 

F = E ± 

is intrinsecally defined. 

Proof. Cartan used an argument that we found not so easy to decipher (see (j4ip on 
section 16721 below) . Thus we prefer to use a different argument to show that B = 0. We 
start with the structure equation 

dijji = —uJij A ou j — uj ia A uJ a 

Since Uij ~£ uJy and Uj ~# oJj (see Equation (123]) with C = I) this implies 

(36) dUi ~£ duji 
Now Equation (|39|) yields 

doJi = dui + dB iX u x + B iX du x 
and this inocently looking expression, together with (I3"B"|) yields 

(37) B; t \du)\ ~b or equivalently Bi\duj\ G X 
Hence if the distribution is of strongly non-holonomic type then B = 0. 

6.2. Digression. The following calculations are actually never explicitly written in 
[TJ, it seems that Cartan does something equivalent to them mentally. A caveat: the 
connection forms uu are antisymmetric in the indices /, J but in general this will not 
be the case for the forms Uu below. If desired, they will have to be antisymmetrized (a 
posteriori) . 

We begin by differentiating 

. ( C B \ , ( dC dB\ 
d0= \Q A ) du+ [o dA) U 

j_ \ ( C B \ , , ( dC dB\ \ ( C- 1 -C- l BA- 1 \ _ 
d °= A H"")+ dA A" 1 
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C B \ , s ( C~ l -C- X BA- 1 



-uu) = I o A ) (""/JO ^ o A" 1 ) 

(38) + ^J^o a- 1 

(to be antisymmetrized) 

The block (— tJy) is given by 



-CiuifiC- 1 + B(u a ,i)C- 1 + (ICC- 1 



We can take C = const. = id since we are not changing the subspace E. In this case 

(39) uJi = bJi + B iX u x 
and 

(40) (-wy) = (-Uij) + B(u; Qi ) 
From equation (14"U)) . Cartan observed: 

Proposition 6.3. T7ie condition ZJij ~£ ofy- zs equivalent to 

(41) BKf) ~ s . 

Cartan showed that under the hypothesis of the derived system being zero ( 14T1) implies 
B = 0. This follows from applying matrix £? to the structure equations 

(42) du a = —u ai oj a + modX , a — m + 1, • • • , n . 
Actually Cartan gave the expression ([I], section §4) like 

(43) doj a = Cij a UiUj + modX 
from which ( 1371) gives 

(44) B iX c jkX = , 
which is assumed to have only the trivial solution^. 

6.3. Equivalence problem for non-holonomic geometry. The method of equiv- 
alence is advertised by Cartan in the 1928 address^, but interestingly, he did not apply 
the method to its full power. We now outline the equivalence problem. 
Recall that for general distributions ( 157)) leads to the condition 

B lX u x e J (1) . 

The derivation was done in the particular case where C = id. But this is not a restric- 
tion. Replacing = (e$)C by the e« does not change the non-holonomic geometry and 
leads to the transition matrix 

(45) g=( I Q ° A 1B \ , CeO(m),AeGL(n-m). 



2 "Nous allons, das ce qui suit, nous borner au cas ou les equations homogenes ^2 x c jkx u \ — aux 
n — m inconnues u m +i, ■ • • , u n n'admettent que la solution u a = 0. Cela revient a dire que le systeme 
derive se reduit a zero" ([T], section §5). 

3 "La recherche des invariants d'un systeme de d'expressions de Pfaff vis-a-vis d'un certain groupe de 
substituitions lineaires effectuees sur ces expressions". (pQ, section 4). 
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Then (cjj) = (ui) + C 1 B(u a ) and (f57j) becomes 

C^S^) G X {1) 

and C _1 can be removed because X^ is a module over the functions on Q. 

In spite of Cartan's caveat@, we hope to raise interest in further research on the Equiv- 
alence problem for non-holonomic geometry: 

Given coframes (fi)v = (^i,^«)v an d (^)w = (ujjOJp)^ on open sets U and V, find 
invariants characterizing the existence of a diffeomorphism F: U — > V satisfying F*Qy — 
g ■ u>u, where the substitutions are of the form 

(46) 3= ( o" a) ' Ce °( m ) > AeGL(n-m) 
with 

(47) B(u a ) G X« = D(l) , X = [up] . 

We recall that X C A 1 (T*Q) is the annihilator of E. The greek indices can be further 
decomposed into two parts: 

capital greek $ = 1, • ■ ■ , r representing forms u<$, G X^; lower case greek letters 
a — m + r + 1, • • • , n , where r = dimX^ , < r < n — m . 

Matrix B can be written B = [Bx^B^) where the first ismxr and the second is m x 
(n — m — r). Condition fl37|) is equivalent to B2 = and our choice of basis implies that 

du$ involve only the u;<j>'s and the a; a 's; 
du a involve at least one of the U{. 

The group of substitutions consist of matrices of the form 
C Bx 

(48) g = I A I , C G 0(m) , A Q G GL(r) , A 2 G GL{n - m - r) . 




A x 

In terms of frames we have 

(ei) = (e*)C 

(49) (e A ) = (ei)Bi + (e*)^ + (e a ) Ax 

(e a ) = (e/3)A 2 

which in particular shows: 

Theorem 6.4. With the above notations, we have: 

i) (ej,e Q ) generate an intrinsic subspace [X^]- 1 , annihilated byX^ l \ 

ii) The e a generate an intrinsic orthogonal complement F of E in [X^]- 1 . 

Hi) There is complete freedom to choose the to complete the full frame for T q Q . 



4 "Si le systeme derive n'est pas identiquement nul, le probleme de la representation geometrique du 
systeme materiel devient plus complique. On est oblige de distinguer different cas, dans chacun desquels, 
par des conventions plus ou moins artificielles, on peut arriver a trouver un schema geometrique approprie. 
Nous n'entreprendrons pas cette etude generale, dont l'interet geometrique sevanouirait rapidement a 
mesure que les cas envisages deviendraient plus compliques" (pQ, §11). 
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7. NON-HOLONOMIC TORSIONS AND CURVATURES. 

In this section we assume the strongly non-holonomic case. Since B = (and as we 
can take C = id) we have 

(50) uJi = uji (ie., ~ TQ ) for % = 1, ...,m 
We look at the original structure equations for the uf. 

(51) dbJi = (duJi) = —UJijUJj — U ia U a 

where u^ = —Uji and we expand certain combination of the coframe basis Uj, up 

(at this point there is still freedom to choose the matrix A defining the cJg). The result 
is of the form 

dui = -UijUJj + 'jkXiUJkUJx + sx^xuj^ 
We now use to our advantage the condition ujij ~# aJjj of Proposition 14. 11 We can modify 

(52) -)■ uJij = u^ + pijxux 
with pijx = -Pjix so 

lk\i -> lk\i — IkXi + PikX 
liXk -> IkXi — liXk + PkiX = liXk - PikX 

There is a unique choice of p's making the 7's symmetrical, namely 

(53) pi k x = 

Summarizing, we have the Cartan structure equations for strongly non-holonomic con- 
nections: 

Theorem 7.1. (Cartan, p], §6.) Consider the non-holonomic connection D with connec- 
tion forms (0) modified as in / T5^) . Then D and D have the same geodesies and 

(54) dui = -UijUj + ^ kXi u k u x + sx^uxOf, 

(55) du a = CijaUi A Uj + mod X 

The forms Uij = —Uji are uniquely defined by the symmetry requirement 

( 56 ) liXk = IkXi- 

Cartan did not invest on computing curvatures^. The curvature forms for the connection 
Uij would be helpful to compute characteristic classes of the bundle E — >• Q. 



En meme temps qu'une torsion, le developpement comporte une courbure, dont il est inutile d'e'erire 
l'expression analytique" (Cartan, [T], §8). We take Cartan's words as dogma, perhaps to be subverted in 
future work. 
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7.1. A canonical choice of metric in F = E x . Assuming the strongly non- 
holonomic hypothesis, flH|) yields for each pair of indices j ^ k, 

(57) CjkaUa = =>• U a = . 

Interpreted as a linear system for the u a , this in particular implies 

-m(m — 1) > n — m or m(m + 1) > 2n . 

We now work on the change of coframes 

(58) ul a = A aX uj\, a = m + 1, ...,n. 
The differentials of the latter are given by: 

duJ a = A a \du\ + mod I 
du a = A aX (-u X iUJi - uxpup) + mod X 

Now, 

(59) u)\i = Cij\ujj + mod X 
so that 

(60) du a = CijaZJi A ujj + mod X 
with 

(61) Cij a = A a \Cij\ . 

We can choose matrix A uniquely by a Gram-Schmidt procedure on the n — m linearly 
independent vectors (c^x) A = m + 1, • • • , n in fljM™-- 1 )/ 2 ). 

Thus we obtain the conditions on the bivectors (Cartan's terminology): 

(62) 2^ CjjcfiijP — ■ 

From this point on, in order to maintain the ortonormality conditions, the change of 
coframes must be restricted to A e 0(n — m). Hence we get 

Theorem 7.2. (Cartan, [TJ, §9). Assume the strongly non-holonomic case. The condi- 
tions ( EIP define uniquely a metric g on TQ = E © F . 

7.2. Geometric interpretation of torsion. Recall the M m = E qo valued 1-form 6 
given by (fH|) 

e = "d 7 " = ujj Uj 

which is the integrand of fll3p . The quotes indicate that this is a loose notation, u dj" is 
not exact. Indeed, we compute 

d6 = duik Uk + duj A Uj . 
Now, duj = —Ukj Uk by construction, so by Proposition 17.11 

(63) d6 = 22, (duik + 22 Uk i A u ^> Uk = tkUk ■ 

k j 

In the strongly non-holonomic case, Theorem 17.11 gives 

(64) dd = (jjxkUJjUJx + sxfik^x^n) u k . 



14 
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Proposition 7.3. Cartan, pQ, §8). Consider an infinitesimal parallelogram in Q spanned 
by vectors u,v in T q Q, and the associated infinitesimal variation du(u,v) in R m . Ifu,v 
belong to E q there is no variation in R m after the cycle. For u G E q and v G E q = F q 
the variation is given by the torsion coefficients jk\i 's- For u,v G F q the variation is 
determined by the coefficients s\ a i 's. 

The symmetry l[5b)) has the following interpretation: 

(65) du(u, n) ■ v = dcu(v, n) ■ u 
with u, v G E q , n G F q . 

One can consider the non-holonomic connection on F associated to the metric ~g. More- 
over, one can repeat the procedure in Theorem 17.11 Write 

(66) duJ a = u\ a uj\ + 5 kXa u k ux + CijaUiUj 

where the ambiguity on the <5's can be removed by changing to another uJ\ a = u\ a + 
mod[a>i] and imposing the symmetry 5k\ a = &ka\ and the antisymmetry TJ\ a = —u a \. 

Mutatis mutandis, the geometric interpretation of the torsion coefficients 5k a \'s and 
Cija's is analogous. In particular, there is no torsion for pairs u,v G F. 

It seems that these geometric interpretations were forgotten by the geometers from the 
60's on. For instance, in the very influential lectures [9J, it is written (p. 59): "as far as 
we know, there is no nice motivation for the word torsion" . 

7.3. The case where F is integrable. When the torsion coefficients in (j54p all 
vanish, duj = ZJijUi, then all the forms duj belong to the ideal 

(67) J = [wi, ...,u m ) 

so by Frobenius theorem, the distribution F = J L is integrable. 

One can construct a local fibration hi — > B, whose fibers are (pieces of) F leaves. 
Choose coordinates (qx, ■ ■ ■ ,q m , q m +x, Qn) on U, such that (qx, • • ■ , q m ) are coordinates on 
B and the fibration is {qx, - " > Qm, Qm+i, Qn) (qi, ' 4 4 , Qm)- The distribution E will be 
given by 

(68) dq a = b ai dqi 

If the functions b a i do not depend on the last m — n coordinates, we have locally an 
M m_n action on U — > B and a connection on this (local) principal bundle. More generally, 
one can formulate the following equivalence problems: 

(1) Given (ui, ■ ■ ■ ,u n ) a coframe on U, find a Lie group G of dimension n — m, a 
diffeomorphism F: U ^ P = B x Q and a connection on the principal bundle P 
such that the distribution E : u a = on U corresponds to the horizontal spaces of 
the connection on P. 

(2) Same, adding the requirement that the vertical spaces b x Q correspond to the 
leaves of F. 

(3) Same, adding a Riemannian metric g on hi and requiring that it corresponds to a 
G-equivariant metric on P. 

(4) Same, requiring that the vertical and horizontal spaces are ^-orthogonal. 

The case 3) was considered in [llj. The non-holonomic geodesic equations on E reduce 
to a certain non-holonomic connection on B. If one prefers to use the Levi-Civita D B 
riemannian connection on B, relative to the projected metric, one gets an equation of the 
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form D^b = K(b) ■ b where K is antisymmetric so that the force in the right hand side is 
gyroscopic (does not produce work). 

This force vanishes in case 4). This seems to be what Cartan had in mind in the abelian 
cas^]. 

8. Restricted connections 

In this section we adopt an "internal" point of view, as opposed to the "extrinsic" 
approach of thepreceding ones. It is quite fragmentary and tentative, aiming to propose 
directions for future work. We change the notation for the configuration space, which will 
be denoted M. 

Consider a subbundle E — > M of TM, and a vector bundle H — > M. 

Definition 8.1. An E-connection D on H is an operator Dxs for X section of E and s 
section of H, satisfying: 

• D is M-linear in X and s and C 00 (M)-linear in X. 

• D is Leibnitzian in s: 

D x fs = X(f)s + fD x s . 

To emphasize the fact that X 6 E, we also call this object an E- restricted connection. 
When H = E an E-restricted connection on E will be called a non-holonomic connection 
on E. 

A comment is in order. This definition seems natural here but we have searched the 
literature and have not found it. In fact, given a vector bundle H — > M, the usual notion 
of a connection D on H (see e.g., Milnor jl], appendix C) means a TM-connection on 
H, in the sense of our Definition 18.11 We will call those full connections. That is, X is 
allowed to be any section of TM, so one is able to covariantly differentiate allong any 
curve c(t) in M. The difference in Def 18.11 is that the covariant differention is defined 
just for curves with c G E. Therefore, to avoid confusion, we called the connection in 
Definition 18.11 a restricted connection. 

Given a full connection, evidently, it can the restricted to E or F . Given a (restricted) 
E-connection on H, can it always be extended to a (full) TM-connection? The answer is 
yes. Consider the following "cut-and-paste" or "genetic engineering" operations: 

Consider a Whitney sum decomposition 

(69) TM = E®F 

with protection operators denotes by P (over E parallel to F) and Q (over F parallel to 
E). 

i) Given a (full) connection DxY on TM, it induces full connections D 1 on E and D 2 on 
F, by restricting Y to one of the factors (say, E) and projecting the covariant derivative 
D X Y over this factor. Since full connections are plentiful, so are restricted ones. 

ii) Given D X ,D 2 E-restricted (F- restricted, respectively) connections on H, it is obvious 
that Dxs = D Xi s + D Xi s defines a full connection on H. 



Si alors dans l'expression de la force vive du systeme on tient compte des equations des liaisons, 
on obtient vme forme quadratique en q[, ■ ■ ■ ,q' m , avec des coefficients fonctions de ?i, • • • , q m - On peut 
appliquer les equations de Lagrange ordinnaires." ([1], §10). 
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Proposition 8.2. Given a non-holonomic connection D^ E ' E ^ on E, and D^ F > E ' an F- 
connection on E, the rule 

(70) D X Y = Df x E) Y + D%f>Y , Y £ T(E) 

defines a TM connection on E extending D. Here P and Q are respectively the projections 
on E (resp. F) along F (resp. E). 

Remark 8.3. However, given an E-restricted connection D 1 in E and an F-restricted 
connection D 2 in F, the rule 

D X Y = D X Y X + D\ 2 Y 2 

fails to define a connection in TM, because 

X 1 (f)Y l + X 2 (f)Y 2 ^X{f)Y. 

The equivalence problem can be rephrased as follows: characterize the class of full 
connections D on M such that their non-holonomic restrictions D E = D^- E,E ^ have the 
same geodesies. 



8.1. Parallel transport and geodesies. The basic facts about TM-connections (see 
Hicks, [9], chapter 5) hold also for E-restricted connections. For instance, 

• (DxY) m depends only on the values of Y £ H along any curve c(t) £ M with 
c(0) = X m . 

• Parallel transport of a vector h Q £ H m along a curve c(t) £ M with c £ E. 

We slightly change the usual proofs ([§]). Take a local basis {hj}, j = 1, ■ ■ • ,p trivial- 
izing H over a neighborhood U C M and vectorfields e 1 , • • • , e q on U C M generating 
E. Here q is the dimension of the fiber of E and p the dimension of the fiber of H. We 
define p 2 q functions T l - k (1 < i, j < p, 1 < k < q) on U by 

(71) DeA—E 1 ^ • 

i 

Write 

c{t) = ^g k (t)e k . 

k 

We search a,j(t) such that h(t) = J2 a j(t)hj satisfies 

D d h(c(t)) = 0. 
We get a linear system of ODEs in p-dimensions 

(72) ^+J2 a Mm k (t) = o. 

i,k 

where r j ih (t) = T\ k (c(t)) . 

Recall that an i?-connection on itself (that is, H = E) is called a non-holonomic 
connection on E. The equation D d c = gives a nonlinear system in n + p dimensions 
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(where n is the dimension of M and p = q is the dimension of E) for x and a given by 

= a k4(x) (l<r <n), 



dt 

k=l,...,p 



1,(1 ' - - a iak ri k (x) = o(i<j< P ). 



(73) 

dt 

i,k=l,...,p 

Here e k r is the r — component (1 < r < n) of the fc-th E'-basis vector (1 < k < p) in terms 
of a standard trivialization of TM. 

8.2. Torsion and curvature. Let D, D two (full) TM-connections, and D E , D their 
restrictions as E-connections along F. Consider the difference tensor on TM 

B { X, Y) = D X Y - D X Y , X, Y G T(TM) . 

Clearly B is C°°(M)-linear in both slots. Decompose B = S + A into symmetric and 
skew-symmetric pieces: 

(74) S(X, Y) = l[B{X, Y) + B(Y, X)} , A(X, Y) = l - [B{X, Y) - B(Y, X)} 
Consider also the torsions 

(75) T D (X, Y) = D X Y - D Y X - [X, Y] , Y) = D X Y - D Y X - [X, Y] . 
It is easy to verify 

(76) 2A(X, Y) = T(X, Y) - T(X, Y) . 

These objects clearly make sense in the restricted version. Recall ([9], 6.5) the notion 
of torsion associated to a (1, 1) tensor P (m G M P m G End{T m M)): 

(77) T P (X, Y) = D X P(Y) - D Y P(X) - P[X, Y] . 

Here we take for operator P the projection over E along F. In the context of restricted 
connections X, Y are vectorfields in E. 

Definition 8.4. Let B E the restriction of B to E, with values projected on E along F, 
and similarly define S E ,A E . Define the restricted torsion by 

(78) TE(X, Y) = D\Y - D§X - P[X, Y],X,Y G T(E) 
The latter is a E m valued tensor (v m , w m ) G E m x E m h-> T E (v m , w m ) 

Theorem 8.5. The following are equivalent: 

El 

a) D E and D have the same E-geodesics. 

b) B E (X,X) = for all X G T(E). 

c) S E = 0. 

d) B E = A E . 

Corollary 8.6. The restricted connections D E and D are equal if and only if they have 
the same geodesies and the same restricted torsion tensors. 

Corollary 8.7. Given a restricted connection D , there is a unique restricted connection 
D E having the same geodesies as D and zero restricted torsion. 



IS 
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Proof. The results on Hicks ([9], section 5.4) follow ipsis Uteris in the restricted context. 
For instance, we show the latter. The uniqueness results from the second proposition. To 
show the existence, we define 

(79) D X Y = D X Y- \t E {X,Y) . 

D E is clearly an i?-connection. We compute 

B E (X, Y) = -T(X, Y) = A E ,S E = 0, 

since the torsion is skew symmetric. Since S E = 0, they have the same geodesies. Finally, 
a simple calculation gives 

T E = T E - 2A E = 

so D E has zero torsion. 

In terms of the original full connection, there is still too much liberty. We can extend 
D E to a full connection with arbitrary completions D F,E , D™' F '. In the spirit of Cartan's 
approach, one would like to characterize special completions. We plan to pursue this 
futurely. 
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